Content–based Coding of Videophone Sequences Using Automatic Face Detection
نویسنده
چکیده
A content–based coding scheme for the transmission of videophone sequences at very low bit rates conforming to the MPEG–4 standard is presented. The goal is to improve the image quality in the facial area of a person, at the expense of a lower quality in the remaining image, which is subjectively less important for the communication partner. In a first step, the face of the talking person is detected automatically. Then, each image is coded and transmitted as two different video object planes (VOP): the face VOP is formed by the facial area, the residual VOP by the remaining image. Thus, a large amount of the available bit rate can be used for coding the face VOP in good quality, while the residual VOP is coded at a lower quality. Using typical videophone sequences and compared to a standard scheme which codes the whole image at the same quality, the proposed scheme shows significant improvements of the image quality in the facial area.
منابع مشابه
Precise Face Model Adaptation for Semantic Coding of Videophone Sequences
In this contribution, an algorithm for the automatic adaptation of a 3D face model for semantic coding of videophone sequences at very low bit rates is presented. After automatic estimation of facial features from an image sequence, the face model is adapted to the eyes, mouth, eyebrows, nose and to the chin and cheek contours of the person’s face in the sequence. Applying the presented algorit...
متن کاملAutomatic Adaptation of a Human Face Model for Model-Based Coding
For coding of videophone sequences at very low bit rates, model-based coding is investigated. In a model-based coder, the human face in the videophone sequence is described by a three-dimensional (3D) face model. At the beginning of the videophone sequence, the model has to be adapted automatically to the shape, position and orientation of the real face present in the scene. In this paper, a ne...
متن کاملEye Activity Detection and Recognition Using Morphological Scale-Space Decomposition
Automatic recovery of eye gestures from image sequences is one of the important topics for face recognition and model-based coding of videophone sequences. Usually complicated models of the eye and its motion are used. In this contribution an eye gesture parameter estimation is described. A previously published automatic eye detection/tracking algorithm, based on template matching, is used for ...
متن کاملSynthesis of Facial Expressions for Semantic Coding of Videophone Sequences
In this contribution, a method for the synthesis of facial expressions for semantic coding of videophone sequences is presented. Firstly, a generic 3D face model is automatically adapted to the eyes, mouth, eyebrows, nose and chin and cheek contours of the individual face in the sequence. Secondly, a synthesis of facial expressions of this individual face is carried out using a model of the hum...
متن کاملA Complete Model–based Video Coder for Coding of Videophone Sequences at Very Low Bit Rates
In this paper, a model–based video coder for coding of head–and–shoulders videophone sequences at very low bit rates is presented. A source model is defined representing head and shoulders of the human person by 3D object components and the face by a pre–defined 3D face model. Each object component is described by 3D shape, 3D motion and texture parameters. Scaling parameters describe the adapt...
متن کامل